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ABSTRACT 

When a considerable number of mutations have no effects on fit- 
ness values, the fitness landscape is said neutral. In order to study 
the interplay between neutrality, which exists in many real-world 
applications, and performances of metaheuristics, it is useful to de- 
sign landscapes which make it possible to tune precisely neutral 
degree distribution. Even though many neutral landscape models 
have already been designed, none of them are general enough to 
create landscapes with specific neutral degree distributions. We 
propose three steps to design such landscapes: first using an al- 
gorithm we construct a landscape whose distribution roughly fits 
the target one, then we use a simulated annealing heuristic to bring 
closer the two distributions and finally we affect fitness values to 
each neutral network. Then using this new family of fitness land- 
scapes we are able to highlight the interplay between deceptiveness 
and neutrality. 

Categories and Subject Descriptors 

1.2.8 [Artificial Intelligence]: Problem Solving, Control Methods, 
and Search. 

General Terms 

Algorithms, performance, design, experimentation. 

Keywords 

Fitness landscapes, genetic algorithms, search, benchmark. 

1. INTRODUCTION 

The Adaptative Landscape metaphor introduced by S. Wright [1] 
has dominated the view of adaptive evolution: an uphill walk of a 
population on a mountainous fitness landscape in which it can get 
stuck on suboptimal peaks. Results from molecular evolution has 
changed this picture: Kimura's model [2] assumes that the over- 
whelming majority of mutations are either effectively neutral or 
lethal and in the latter case purged by negative selection. This as- 
sumption is called the neutral hypothesis. Under this hypothesis, 
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dynamics of populations evolving on such neutral landscapes are 
different from those on adaptive landscapes: they are characterized 
by long periods of fitness stasis (population stated on a 'neutral 
network') punctuated by shorter periods of innovation with rapid 
fitness increases [3]. In the field of evolutionary computation, neu- 
trality plays an important role in real- world problems: in design 
of digital circuits [4] [5] [6], in evolutionary robotics [7] [8]. In 
those problems, neutrality is implicitly embedded in the genotype 
to phenotype mapping. 

1.1 Neutrality 

We recall a few fundamental concepts about fitness landscapes 
and neutrality (see [9] for a more detailed treatment). A landscape 
is a triplet (S, V, f) where S is a set of potential solutions i.e. a 
search space, V : S — ■> 2 s , a neighbourhood structure, is a func- 
tion that assigns to every s € 5 a set of neighbours V(s), and 
/ : S — * M, is a fitness function that can be pictured as the "height" 
of the corresponding potential solutions. The neighbourhood is of- 
ten defined by an operator like bitflip mutation. A neutral neigh- 
bour of s is a neighbour with the same fitness f(s). The neutral 
degree of a solution is the number of its neutral neighbours. A 
fitness landscape is neutral if there are many solutions with high 
neutral degree. A neutral network, denoted NN, is a connected 
graph where vertices are solutions with the same fitness value and 
two vertices are connected if they are neutral neighbours. 

1.2 Fitness Landscapes with Neutrality 

In order to study the relationship between neutrality, dynamics of 
Evolutionary Algorithms (EA) and search difficulty, some bench- 
marks of neutral landscapes have been proposed. More often neu- 
trality is either an add-on feature, as in NK-landscapes, or an inci- 
dental property, as in Royal-Road functions. In most cases the de- 
sign acts upon the amount of solutions with the same fitness. Royal- 
Road functions [10] are defined on binary strings of length N = 
n.k where n is the number of blocks and k the size of one block. 
The fitness function corresponds to the number of blocks which are 
set with k bits value 1 and neutrality increases with k. Numerous 
landscapes are variant of NK-Landscapes [11]. The fitness function 
of an N ^-landscape is a function / : {0, 1}^ -> [0, 1) defined 
on binary strings with N bits. An 'atom' with fixed epistasis level 
is represented by a fitness component ft : {0, 1} — > [0, 1) as- 
sociated to each bit i. It depends on the value at bit i and also on 
the values at K other epistatic bits. The fitness / is the average 
of the values of the TV fitness components ft, Several variants of 

A'-landscapes try to reduce the number of fitness values in or- 
der to add some neutrality. In AA'p-landscapes [12], fi(x) has a 
probability p to be equal to ; in NK g-landscapes [13], ft(x) is 
uniformly distributed in the interval [0, q — 1] nlN; in Technological 



Landscapes [14], tuned by a natural number M, f(x) is rounded 
so that it can only take M different values. For all those problems, 
neutrality is tuned by one parameter only: neutrality increases ac- 
cording to p and decreases with q or M (see for example Figure 
2). 

Dynamics of a population on a neutral network are complex, 
even on flat landscapes as shown by Derrida [15]. The works [16] 
[17] [18] [19], at the interplay of molecular evolution and optimiza- 
tion, study the convergence of a population on neutral networks. In 
the case of infinite population under mutation and selection, they 
show distribution on a AW is only determined by the topology of 
this network. That is to say, the population converges to the solu- 
tions in the NN with high neutral degree. Thus, the neutral degree 
distribution is an important feature of neutral landscapes. 

In order to study more precisely neutrality, for instance link be- 
tween neutrality and search difficulty, we need for "neutrality-driven 
design" where neutrality really guides the design process. In this 
paper we propose to generate a family of landscapes where it is 
possible to tune accurately the neutral degree of solutions. 

2. ND-LANDSCAPES 

In this section, we first present an algorithm to create a land- 
scape with a given neutral degree distribution. Then we will refine 
the method to obtain more accurate landscapes and finally we will 
study time and space complexity of the algorithm. 

2.1 An algorithm to design 
small ND-Landscape 

We now introduce a simple model of neutral landscapes called 
ND-Landscapes where N refers to the number of bits of a solution 
and D to the neutral degree distribution. In this first step our aim is 
to provide an exhaustive definition of the landscape assigning one 
fitness value to each solution. We fix N to 16 bits and so the size of 
search space is 2 16 . Building a ND-Landscape is done by splitting 
the search space into neutral networks. However the fitness value 
of each neutral network has no influence on the neutrality. This is 
why these fitness values are randomly chosen. 

Let D be an array of size N+l representing a neutral degree dis- 
tribution. N and D are given as inputs and the algorithm (see algo- 
rithm 1) returns a fitness function f from {0, 1}^ to 1R such that 
the neutral degree distribution of the fitness landscape is similar to 
D. For more simplicity, we chose to give a different fitness value 
to each neutral network. We define Roulette Wheel(D) as a random 
variable whose density is given by distribution D. It is directly in- 
spired from the genetic algorithm selection operator. For example: 
let A be the following distribution : 

A[0]=0 A[l]=0.25 A[2]=0.5 A[3]=0.25. RouletteWheel(A) 
will return value 1 in 25 % of the time, 2 in 50 % of the time and 
3 in 25 % of the time. Figure 1 shows the neutral networks of an 
ideal ND-Landscape (size=2 5 ) for the distribution A. 

2.2 A metaheuristic to improve the ND design 

Using algorithm 1, exhaustive fitness allocation does not create 
a landscape with a neutral degree distribution close enough to the 
input distribution. The reason is the fitness function is completely 
defined before the neutral degree of every solution has been con- 
sidered. Hence, we use a simulated annealing metaheuristic to im- 
prove the landscape created by algorithm 1. Here, simulated an- 
nealing is not used to find a good solution of a ND-Landscape but 
to adjust the landscape by modifying the fitness of some solutions 
such as neutral distribution of a ND-Landscape be closer to the in- 
put distribution. The local operator is the changement of fitness 
value of one solution of the landscape, which can alter at most N+l 




Figure 1: Example of a tiny ND-Landscape. Each node repre- 
sents a solution and two nodes are connected if they have the 
same fitness value and are Hamming neighbours. In this exam- 
ple there are five neutral networks. 



Algorithm 1 Generation of ND-Landscapes 



Vs e S, f[s] <- unaffected 
randomly choose one solution so. 
CandidatesList <— S sorted by distance from so. 
while not empty(CandidatesList) do 
s <— hea.d(CandidatesList) 
for d = to N do 

if s can't have d neutral neighbours 
thenD'[d]^0 
elseD'fd] <- D[d] 
end for 

n <- RouletteWheel(D'[d]) 

Give a value to some unaffected neighbours so that s has ex- 
actly n neutral neighbours and so that the neutral degrees of 
already chosen solutions (^ CandidatesList) are unchanged. 
D[n] «- D[n] - ^ 

CandidatesList <— next(CandidatesList) 
end while 
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Figure 2: Neutral degree distribution for some neutral landscapes 



neutral degrees. The acceptance of a transition is determined by the 
difference between the distance to the input distribution before and 
after this transition. The distance we use to compare two distribu- 
tions is the root mean square : 



dist(D,D ) 



Simulated annealing appeared to be a fast and efficient method 
for this particular task. Improvements made by simulated annealing 
are shown in figure 3. Easiest neutral distributions to obtain seemed 
to be smooth ones like gaussian distributions (which are the most 
encountered when dealing with real or artificial neutral problems). 
On the other hand, sharp distributions (like the middle-right one of 
figure 3) are really hard to approach. In addition, independently 
of the shape, distributions with higher average neutral degree are 
harder to approximate. 

2.3 Space and Time complexity 

To create a landscape with a search space of size 2^, we use an 
array of size 2 N containing fitness values and a list of forbidden 
values for each solution. Thus we need a memory space of size 
0(2 N x N). Consequently the space complexity is : 0(2 N x N) 

In order to know what are the possible neutral degrees of an un- 
affected solution s, we must consider every interesting value for s 
(the fitnesses of all neighbour solutions and a random value), and 
for each of these values, we must find out all possible neutral de- 
grees. This can be done in a time 0(N ). We evaluate the possible 
neutral degrees once for each solution. Time allowed for simulated 
annealing is proportional to the time elapsed during construction. 
Thus, the time complexity of the algorithm is 0(2 N N 2 ). 



Consequently we can only construct ND-Landscapes with a small 
N (< 16) but we will see in section 4 how to create Additive Ex- 
tended ND-Landscapes with far greater search spaces. 

2.4 Sizes of the generated Neutral Networks 

Figure 4 shows the diversity of sizes of neutral networks for 4 
distributions. For every distribution we created 50 different ND- 
Landscapes. Graphics on the left show the input and the mean re- 
sulting distribution. Graphics on the right show all of the networks 
of these landscapes sorted by decreasing size with a logarithmic 
scale. We clearly see that the neutral degree distribution is a really 
determining parameter for the structure of the generated landscape. 

3. TUNING DECEPTIVENESS 
OF ND-LANDSCAPES 

Once we have generated a landscape with a specific neutral de- 
gree distribution, we can change the fitness value of all neutral net- 
works without changing the neutral degree distribution (as long as 
we do not give the same fitness to two adjacent networks). Hence, 
for a given neutral distribution, we can tune the difficulty of a ND- 
Landscape. For instance if each NN have a random fitness value 
from [0, 1] then the landscape is very hard to optimize. Here, we 
will use the well known Trap Functions [20] to affect fitnesses to 
NN in order to obtain a ND-Landscape with tunable deceptive- 
ness. 

The trap functions are defined from the distance to one particular 
solution. They admit two optima, a global one and a local one. 
They are parametrized by two values b and r. The first one, b allows 
to set the width of the attractive basin for each optima, and r sets 
theirs relative importance. The function fr : {0, 1}^ — > H is so 
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Figure 3: Neutral Degree Distributions obtained by algorithm 1 (Di) and then adjusted by simulated annealing (D 2 ). Neutral degrees 
are on abcissa. Impulses represent the target distributions (Do). 



defined by: 



h(x) = 



1.0 - 

r(d(x) 



if d(x)<b, 



elsewhere 



where d(x) is the Hamming distance to the global optimum, di- 
vided by N, between x and one particular solution. The problem is 
most deceptive as r is low and b is high. In our experiment, we will 
use two kinds of Trap functions with r = 0.9, one with b = 0.25 
and another one with b — 0.75 (see figure 5 (a) and (b)). 

To affect a fitness value to each neutral network, we first choose 
the optimum neutral network, denoted NN opt , (for example the 
one containing the solution 0^) and set its fitness to the maximal 
value 1.0. Then, for each neutral network, we compute the distance 
d between its centroid 1 and the centroid of NN opt ; finally the fit- 



'The centroid of a NN is the string of the frequency of appearance 



ness value of the NN is set according to a trap function 2 and the 
distance d. In order to ensure that all adjacent networks have dif- 
ferent fitness values, it is possible to add a white noise to the fitness 
values of each NN. In the following experiments, the length of 
bitstring is iV = 16. ND-landscapes are constructed with uniform 
neutral degree distributions. We use the distributions defined by 



1/w if i £ {p,p + w — 1}, 
elsewhere 



where p £ {0, 7}, w G {3, 4}, and the two Trap functions de- 
fined in figure 5. For each distribution and each Trap function, 30 
landscapes were generated. 



of bit value 1 at each position. 

2 This trap function is defined for all real numbers between and N 
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Figure 4: Neutral networks sizes for various ND-landscapes 

where: 



3.1 Fitness Distance Correlation 
of ND-Landscapes 

To estimate the difficulty to search in these landscapes we will 
use a measure introduced by Jones [21] called fitness distance cor- 
relation (FDC). Given a set F = {/i, fe, f m } of m individual 
fitness values and a corresponding set D = {di, d,2, d m } of the 
m distances to the global optimum, FDC is defined as: 



FDC = 



C F 



1 m 

C FD = -J2(fi - f)(<k - d) 



is the covariance of F and D and of, ctd, f and d are the standard 
deviations and averages of F and D. Thus, by definition, FDC 
stands in the range [—1,1]. As we hope that fitness increases as 
distance to global optimum decreases, we expect that, with an ideal 
fitness function, FDC will assume the value of — 1. According to 
Jones [21], problems can be classified in three classes, depending 
on the value of the FDC coefficient: 
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Figure 5: Two trap functions: Deceptive one with b — 0.25 and 
r = 0.9 (a), Easy one with b = 0.75 and r = 0.9 (b) 

• Deceptive problems (FDC > 0.15), in which fitness in- 
creases with distance to optimum. 

• Hard problems (—0.15 < FDC < 0.15) in which there is 
no correlation between fitness and distance. 

• Easy problems (FDC < —0.15) in which fitness increases 
as the global optimum approaches. 

Hard problems are in fact hard to predict, since in this case, the 
FDC brings little information. The threshold interval [—0.15, 0.15] 
has been empirically determined by Jones. When FDC does not 
give a clear indication i.e., in the interval [—0.15,0.15], examin- 
ing the scatterplot of fitness versus distance can be useful. The 
FDC has been criticized on the grounds that counterexamples can 
be constructed for which the measure gives wrong results [22, 23, 
24]. 

Figure 6 shows the average and standard deviation of FDC over 
ND-Landscapes for each set of parameters, neutral distribution and 
deceptiveness. The absolute value of FDC decreases as we gener- 
ate ND-Landscapes with more and more neutrality. When adding 
neutrality, the landscapes are increasingly flatter and thus less easy 
or deceptive. So neutrality smoothes correlation. Adding neutrality 
to a deceptive landscape makes it easier and adding neutrality to a 
easy landscape makes it harder. 

3.2 Genetic Algorithm Performances on ND- 
Landscapes 

In this section, difficulty is measured by genetic algorithm per- 
formances which is the success rate over 10 independent runs. In 
order to minimize the influence of the random creation of ND- 
Landscapes, we consider 30 different landscapes for each distri- 
bution D and each trap function. For the GA, one-bit mutation 
and one-point crossover are used with rates of respectively 0.8 and 
0.2. The evolution, without elitism and with 3-tournament selec- 
tion of a population of 50 individuals took place during 50 gener- 
ations. Figure 8 shows average and standard deviation of GA per- 
formances over the 30 ND-landscapes. Until the average neutral 
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Figure 6: Average and standard deviation of FDC as a function 
of average neutral degree for ND-landscapes created from easy 
and deceptive Trap functions. 

degree 5, the landscapes are fully deceptive (a) or fully easy (b). 
Between neutral degree 5 and 7, the deceptiveness and the easiness 
decrease. After neutral degree 7, the two trap functions have nearly 
a same good success rate (0.7). Whereas the FDC coefficient gave 
no information about performances, examining the scatter plot fit- 
ness/distance allowed to predict these good performances (figure 
7 (c)). When neutrality increases on deceptive trap, performances 
increase whereas on easy trap they decrease. These results confirm 
conclusions found in section 3.1 from FDC measures. 

4. ADDITIVE EXTENDED 
ND-LANDSCAPES 

Exhaustive fitness allocation allows only to generate landscapes 
with small search space (in our experiments 2 16 solutions). Hence 
we must find a way to construct similar problems on a larger scale. 
We propose here to concatenate several small ND-Landscapes to 
create an additive ND-Landscape. Even though we are not able 
to create an additive ND-Landscape directly from a neutral degree 
distribution we can chose the mean, the standard deviation and less 
precisely the shape of his neutral degree distribution. Moreover, 
this method allows us to know exactly the neutral degree distribu- 
tion of the resulting landscape. Let be Pi = (Ei,Vi, fi) and 
Pi = (E 2 , V 2 , f 2 ) two fitness landscapes. We define the extended 
landscape P = PiffiP 2 = (E, V, f) such as : 

• E = Ei x E 2 

. V(iri,i/i) e E%,V(X2,V2) 6 El 

(xi,x 2 ) £ V(yi,y 2 ) -<=> (xi £ Vi(yi) and x 2 = 
y 2 ) or (xi = yi and x 2 £ V 2 (y 2 )) 

• f(xi,xz) = fi(xi) + f 2 (x 2 ) 

The size of the larger ND-landscape will be the product of the sizes 
of the small ones. The neutral degree distribution of the resulting 
landscape is the convolution product of the two components distri- 
butions. The convolution product of two distributions Di and D 2 
is the distribution D (see figure 9) such as 

n 

Vn £ JN,D(n) = ^(£>i(i) x D 2 {n-i)) 

i=0 

We have the following properties : 
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Figure 7: ND-Landscapes : FDC scatter plot from easy trap (cf 
Figure 5 a) with respective average neutral degree (a) 1.5 (b) 6 
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Figure 8: Average with standard deviation of success rate of 
a GA on ND-landscapes created from easy and deceptive trap 
functions 
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Figure 9: Example of convolution product 



average(D) = average(Di) + average(Di) 
and cr(-D) = \/ct 2 (Di) + ~oHD-i) where a is the standard de- 
viation. 

The convolution product of two normal (resp. x 2 , Poisson) dis- 
tributions is a normal (resp. \ 2 , Poisson) distribution. 

Concatenation is commutative, so it is simple to concatenate a 
number of small landscapes. Hence, we can tack together many 
small ND-Landscapes generated exhaustively to obtain a bigger 
one with known neutral degree distribution. Moreover, it has been 
proven by Jones [21] that the FDC coefficient is unchanged when 
multiple copies of a problem are concatenated to form a larger 
problem. 

Conclusion 

This paper presents the family of ND-Landscapes as a model of 
neutral fitness landscapes. Most of the academic fitness landscapes 



found in the literature deals with neutrality, either as an add-on fea- 
ture or as an incidental property. Theoretical studies in molecular 
evolution, as well as in optimization, show that the distribution of 
neutral degree is a key factor in the evolution of population on a 
neutral network. Hence to understand and study the influence of 
neutrality on evolutionary search process it may be helpful to de- 
sign landscapes with a given neutral distribution. We propose three 
steps to design such landscapes: first using an algorithm we con- 
struct a landscape whose distribution roughly fits the target one, 
then we use a simulated annealing heuristic to bring closer the two 
distributions, and finally we affect fitness values to each neutral 
network. The last step can be used to tune the landscape diffi- 
culty according to other criteria than neutrality. So we can study 
interaction between neutrality and another source of hardness. In 
this paper we use trap functions to affect fitness values, this allows 
us to study the interplay between deceptiveness and neutrality; in 
particular, experimental results show that neutrality smoothes the 
correlation between fitness and distance. For some problems, intro- 
ducing neutrality could be benefic as shown in [25] by increasing 
the evolvability. On the other hand, neutrality could destroy useful 
information such as correlation as it is shown in our experiments. 
As these landscapes need an exhaustive enumeration of the search 
space we propose to concatenate small ND-Landscapes to scale up 
over 16 bits. Then, using the convolution product of distribution, 
we are able to design large landscapes with known neutral distribu- 
tion. 

Future work along these lines includes studying the influence of 
various distributions and different ways to affect fitness value on the 
dynamics of a population on a neutral network. With ND-trap func- 
tions we have focused on the interplay between deceptiveness and 
neutrality. By replacing trap functions by NK functions[ll], we 
might be able to highlight correlation between epistasis and neu- 
trality. 
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